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NOTICE 

The invention disclosed in this document resulted from 
research in aeronautical and space activities performed under 
programs of the National Aeronautics and Space Administration. 
The invention is owned by NASA and is, therefore, available for 
licensing in accordance with the NASA Patent Licensing 
Regulation (14 Code of Federal Regulations 1245.2). 

To encourage commercial utilization of NASA-owned inventions, 
it is NASA policy to grant licenses to commercial concerns. 
Although NASA encourages nonexclusive licensing to promote 
competition and achieve the widest possible utilization, NASA 
will consider the granting of a limited exclusive license, 
pursuant to the NASA Patent Licensing Regulations, when such a 
license will provide the necessary incentive to the licensee to 
achieve early practical application of the invention. 

Address inquiries and all applications for license for this 
invention to NASA Resident Office-JPL, NASA Patent Counsel, 

Mail Code 180-801, 4800 Oak Grove Dr., Pasadena, CA 91103. 
Approved NASA forms for application for nonexclusive or 
exclusive license are available from the above address. 
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PATENT APPLICATION 


The invention described herein was made in the 
performance of the work under a NASA Contract and is 
subject to the provisions of Public Law 96517 (35 USC 
202) in which the contractor has elected not to retain 
title . 


This invention relates to control systems for 
controlling robotic manipulators. 


The next generation of robotic manipulators 
will perform high-precision tasks in partially unknown 
and unstructured environments. These tasks require 
precise motion control of the manipulator under unknown 
and varying payloads. These requirements are far beyond 
the capabilities of present-day industrial robot 
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controllers, and demand robust high-performance 
manipulator control systems. The need for advanced 
manipulator control systems to accomplish accurate 
trajectory tracking has therefore been recognized for 
some time, and two parallel lines of research have been 
pursued. The primary outcome of such research is the 
development of two classes of advanced manipulator 
control schemes, namely model-based and performance- 
based techniques. 

Model-based techniques, such as the Computed Torque 
Method by B. R. Markiewicz: Analysis Of The Computed 
Torque Drive Method And Comparison With Con vent ional 
Position Servo For A Computer-Controlled Manipulator , 
Technical Memorandum 33-601, Jet Propulsion Laboratory, 
1973, are based on cancellation of the nonlinear terms 
in the manipulator dynamic model by the controller. 
This cancellation is contingent on two assumptions which 
are not often readily met in practice. First, the 
values of all parameters appearing in the manipulator 
dynamic model, such as payload mass and friction 
coefficients, must be known accurately. Second, the 
full dynamic model of the manipulator needs to be known 
and computed on-line in real-time at the servo control 
rate. Performance-based techniques, such as the direct 
adaptive control method by S. Dubowsky and D. T. 
DesForges : The Application Qf Model-Refer enced Adaptive 
Control To Robot ic Manipulators. ASME Journal of Dynamic 
Systems, Measurement and Control, Vol. 101, pp. 193-200, 
1979, attempt to overcome these limitations by adjusting 
the controller gains on-line in real-time, based on the 
tracking performance of the manipulator; and, thus, 
eliminating the need for the manipulator model. 
Therefore, the identification of the manipulator and 
payload parameters or the complex manipulator dynamic 
model is not necessary, and hence a fast adaptation can 
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1 be achieved. Adaptive control methods, however, may 

2 become unstable for high adaptation rates and treat the 

3 manipulator as a "black-box" by not utilizing any part 

4 of the manipulator dynamics in the control law 

5 formulation. 

6 During the past few years, several attempts have 

7 been made to combine the model-based and performance- 

8 based techniques in order to take full advantage of the 

9 merits of both techniques and overcome their 

10 limitations. For instance, the approach of J. J. Craig, 

11 P. Hsu, and S. S. Sastry: Adaptive Control Of 

12 Mechanic al Manipulators. Proc. IEEE Intern. Conf . on 

13 Robotics and Automation, Vol. 1, pp. 190-195, San 

14 Francisco, 1986, and R. H. Middleton and G. C. Goodwin: 


16 Manipulators . Proc. IEEE Conf. on Decision and Control, 

17 vol. 1, pp. 68-73, Athens, 1986, the manipulator 

18 parameters are estimated adaptively first and are then 

19 utilized in a dynamic-based control law. 

20 A search was conducted in the following classes and 

21 subclasses. 


CLASS 


561, 567, 568, 569, 599, 600, 601 
616, 617, and 685, 561, 604, 618 and 


26 364 133, 134, 148, 150, 157, 162, 165, 

27 183, 193, 478 and 513 

28 901 14, 15 and 19 

29 The results of the search include the following 

30 patents: 

31 Oswald 4,200,827 

32 Penkar et al. 4,773,025 

33 Axelby et al. 4,663,703 

34 Takahashi et al. 4,639,652 

35 Shigemasa 4,719,561 
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Browder 4,341,986 
Hafner et al. 4,546,426 
Horak 4,547,858 
Perzley 4,603,284 
Perreirra et al . 4,763,276 
Littman et al. 3,758,762 
Hiroi et al. 4,563,735 
Matsumura et al . 4,670,843 
Shigemasa 4,679,136 

First attention is directed to Oswald 4,200,827 
which discloses a control system for a magnetic head 
including both velocity and position feedback and 
feedforward signals representing both velocity and 
acceleration. See Figure 1 and column 3, line 17 to 
column 6, line 35. Also, see Penkar et al. 4,773,025 
and Axelby 4,663,703. 

Next attention is directed to Takahashi et al . 
4,639,652 which discloses a control system for a robot 
manipulator including adaptive position and velocity 
feedback gains. See gain adjuster 14 and gains 5 and 6 
in Figure 1. Particular attention should be given to 
the circuit diagram presented in Figure 4 of this 
reference which differs from Figure 1 only in the use of 
the transfer function T(s) and operates in accordance 
with the description beginning at column 4, line 48 
through column 6, line 29. This reference is of 
interest only because of the high speed positioning 
control. It relies upon a prior art technique commonly 
known as "Identification”, in that test runs permit the 
gains of its control law to be identified. 

Next attention is directed to Shigemasa 4,719,561 
which discloses a control system having robust 
controller 24 in combination with a PID controller 22. 
See Figure 3 and column 5, line 13. 

The following references all disclose a control 


1 1 

5 

system for a robot manipulator. 

2 

Browder 4,341,986 

3 

Hafner 4,546,426 

4 

Horak 4,547,858 

5 

Perzley 4,603,284 

6 

Perreirra et al. 4,763,276 

7 

The following references are cited as of interest. 

8 

Littman et al. 3,758,762 

9 

Hiroi et al. 4,563,735 

10 

Matsumura et al . 4, 670,843 

11 

Shigemasa 4,679,136 

12 


13 

In contrast to the Computed Torque Method of the 

14 

prior art, the invention does not rely on an accurate 

15 

dynamic model in order to control the manipulator. 

16 

Furthermore, global asymptotic stability of the control 

17 

system is assured since the feedback adaptation laws are 

18 

derived from a Lyapunov analysis, and the feedforward 

19 

controller is outside the servo control loop. 

20 


21 

A new, robust control system using a known part of 

22 

the manipulator's dynamics in a feedforward control 

23 

circuit and any unknown dynamics and uncertainties 

24 

and/or variations in the manipulator/payload parameters 

25 

is accounted for in an adaptive feedback control loop. 

26 


27 

SUMMARY OF THE INVENTION 

28 


29 

This invention discloses and claims a novel 

30 

approach of combining model-based and performance-based 

31 

control techniques. Two distinct and separate design 

32 

philosophies have been merged into one novel control 

33 

system. The invention's control law formulation is 

34 

comprised of two distinct and separate components, each 

35 

of which yields a respective signal component that is 
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6 

combined into a total command signal for the system. 

Those two separate system components include a 
feedforward controller and a feedback controller. The 
feedforward controller is model-based and contains any 
known part of the manipulator dynamics that can be used 
for on-line control to produce a nominal feedforward 
component of the system's command signal. The feedback 
controller is performance-based and consists of a simple 
adaptive PID controller which generates an adaptive 
control signal to complement the nominal feedforward 
signal. The feedback adaptation laws are very simple, 
allowing a fast servo control loop implementation. 

BRIEF SUMMARY OF THE FIGURES QF THE DRAWING 

Figure 1 is a figure depicting a schematic diagram 
of a typical actuator and link assembly in accordance 
with the invention; 

Figure 2 is a figure depicting a 
feedforward/feedback tracking control scheme in 
accordance with the invention; 

Figure 3 is a figure depicting a two-link planar 
manipulator in a vertical plane in accordance with the 
invention; 

Figure 4(i) is a figure depicting the desired 
[dashed] and actual [solid] trajectories of the joint 
angle 6 (t) in accordance with the invention; 

Figure 4 (ii) is a figure depicting the desired 
[dashed] and actual [solid] trajectories of the joint 
angle 0 2 (t) in accordance with the invention; 

Figure 5(i) is a figure depicting the variation of 
the tracking - error e 2 <t) in accordance with the 
invention; 

Figure 5(ii) is a figure depicting the variation of 
the tracking - error ei(t) in accordance with the 
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invention; 

Figure 6(i) is a figure depicting the variation of 
the control torque Ti(t) in accordance with the 
invention; 

Figure 6(ii) is a figure depicting the variation of 
the control torque T 2 (t) in accordance with the 
invention; 

Figure 7(i) is a figure depicting the variations of 
the auxiliary signals fi(t) [solid] and f 2 (t) [dashed] 
in accordance with the invention; 

Figure 7 (ii) is a figure depicting the variations 
of the position gains k p i(t) [solid] and k p 2 (t) [dashed] 
in accordance with the invention; 

Figure 7(iii) is a figure depicting the variations 
of the velocity gains kvMt) [solid] and k v 2 (t) [dashed] 
in accordance with the invention; 

Figure 8 is a figure depicting the functional 
diagram of the testbed facility in accordance with the 
invention; 

Figure 9(i) is a figure depicting the desired 
[dashed] and actual [solid] PUMA waist angles under 
adaptive controller in accordance with the invention; 

Figure 9(ii) is a figure depicting the waist 
tracking-error under adaptive controller in accordance 
with the invention; 

Figure 10 (i) is a figure depicting the desired 
[dashed] and actual [solid] PUMA waist angles under 
unimation controller in accordance with the invention; 

Figure 10 (ii) is a figure depicting the waist 
tracking-error under unimation controller in accordance 
with the invention; 

Figure 11 (i) is a figure depicting the variation of 
the auxiliary signal f(t) in accordance with the invention 
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Figure 11 (ii) is a figure depicting the variation 
of the position gain k p (t) in accordance with the 
invention; 

Figure ll(iii) is a figure depicting the variation 
of the velocity gain k v (t) in accordance with the 
invention; 

Figure 11 (iv) is a figure depicting the variation 
of the control torque T(t) in accordance with the 
invention; 

Figure 12 (i) is a figure depicting the desired 
[dashed] and actual [solid] waist angles with arm 
configuration change in accordance with the invention; 
and 

Figure 12 (ii) is a figure depicting the waist 
tracking-error with arm configuration change in 
accordance with the invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
1. SUMMARY OF PRESENTATION. 

The presentation of the invention is structured as 
follows. In Section 2, the integrated dynamic model of 
a manipulator and actuator system is derived. The 
tracking control scheme is described fully in Section 3. 
In Section 4, the digital control implementation of the 
scheme is given. The issue of robustness is discussed 
in Section 5. The control scheme is applied in Section 
6 to the model of a two-link arm, and extensive 
simulation results are given to support the method. In 
Section 7, the implementation of the proposed control 
scheme on a PUMA industrial robot is described and 
experimental results are presented to validate the 
improved performance of the invention. Section 8 
discusses the results and concludes the presentation of 
the description of the invention. 

2 . INTEGRATED DYNAMIC MODEL OF MANIPULATOR-PLUS- 
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ACTUATOR SYSTEM 

Most papers on manipulator control neglect the 
dynamics of joint actuators, and treat the joint torques 
as the driving signals. In this section, I take a 
realistic approach by including the actuator dynamics 
and modeling the manipulator and actuators as an 
integrated system. 

In many industrial robots such as the Unimation 
PUMA, the links of the manipulator are driven by 
electric actuators at the corresponding joints, and the 
dynamics of the joint actuators must be taken into 
account. Note that although electric actuators are 
modeled hereinafter, the results are general since the 
form of dynamic equations for other types of actuators 
is essentially the same. 

Referring to Figure 1, each actuator 100 may be 
considered as comprising a link 101, driven by a gear 
110 that meshes with a motor-driven drive gear 125. Many 
such actuators are generally required for any given 
robotics application. A single actuator as a general 
case will be presented in this application for 
simplicity purposes. It should be understood that 
several actuators as needed are driven by the command 
signal as developed by the control system of this 
invention . 

Any typical actuator is basically a DC servomotor 
with a permanent magnet 130 to provide the motor field 
and the driving signal is a voltage or a current applied 
to the armature winding. In Figure 1, a driving voltage 
identified simply as Vj is impressed across a pair of 
input terminals 140. The resistance and inductance 
shown in the Figure simply represent the internal 
parameters typically found in any actuator and such 
matters are well known in the art and require no further 
description. Since servomotors are inherently high-speed 
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1 low-torque devices, the gear assembly 110,125 is often 

2 required to mechanically couple the armature shaft 126 

3 to the robot link 110 in order to obtain speed reduction 

4 and torque magnification. 

Consider now the jth actuator 100 and suppose that 

6 the armature is voltage-driven, as shown in Figure 1. 

7 This representation is general because in cases where 

8 the armature is current-driven using the current source 

9 i(t) with shunt resistance R c/ the driving source can be 
10 replaced by the voltage source v(t) = R c i(t) with series 


11 

12 


resistance R v -~z~. Therefore, without loss of 


13 generality, we can assume that the driving source of the 

14 jth joint motor is always the voltage source Vj(t) with 

15 the internal resistance rj. This source produces the 

16 current ij (t) in the armature circuit; and the 

17 electrical equation for the jth actuator can be written 

18 as 


19 

20 


V t)= r , V £) + V, (t)+ V3£ [ V t)I+ 


( 1 ) 


where Rj and Lj are the resistance and inductance of the 
22 jth armature winding, d (t) is the angular displacement 


23 

24 

25 

26 

27 

28 
29 


of the jth armature shaft, and the term K 


d6 } (t) 


bj 


dt 


is 


due to the back-emf generated in the armature circuit . 
Let us now consider the mechanical equation of the 
actuator. Referring to the armature shaft, the 
"equivalent" moment of inertia and friction coefficient 
of the total load are given by K. Ogata: Modern Control 

30 [[ Engineering . Prentice Hall Inc., N.J., 1970 
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34 
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f * + 



ijL ^ 


+ (N^ 



(3) 


where {Jj m ,fj m } and {Jj , fj^} are the moments of inertia 
and friction coefficients of the jth motor shaft and the 
jth robot link respectively, while Nj m and Nj are the 
numbers of gear teeth on the motor side and on the link 

N 

side respectively, and N = — — <1 is the gear ratio. 

3 N j^ 

Although it is assumed that there is one gear mesh 
between the motor and the link, the result can be 
extended to multi-mesh gear trains in a trivial manner. 
See K. Ogata: Modern Control Engineering . Prentice Hall 

Inc., N.J., 1970. Equations (2) and (3) indicate that, 
as seen by the motor shaft, the link inertia and 
friction are reduced by a factor of (Nj)2. Now, the 
torque r^(t) generated by the jth servomotor is 
proportional to the armature current ij(t); that is, 
r^ (t) = i (t) , and will cause rotation of the armature 
shaft by <zS ^ (t.) . In addition, the armature will exert an 
"effective" torque Tj(t) on the jth robot link through 
the gear train. Thus, the mechanical equation for the 
jth actuator can be expressed as (refer to K. Ogata, 


supra) 


d <t) 

~ +f i 

dt 3 


d«5 5 (t) 
dt 


+ N j T j (t) = rj (t) 


(4) 


Let us now denote the angular displacement of the jth 


robot joint by 0^(t) , where 
e^t) =N j «$ j (t) 


( 5 ) 


due to the gear train. Then, (1) and (4) can be written 
in terms of 0 (t) as 
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di (t) . d0 (t) 

V j (t)= (r j + R j )i j (t) +1^ + N^ K bj dt 


J d 0 (t) f d0 (t) 

r (t ) = K i (t) = “ — + +N T (t) 

j aJ 3 N 5 dt Nj dt 33 


(6) 


(7) 


Eliminating the armature current ij (t) between (6) and 
(7) , the dynamic model of the jth actuator can be 
described by the third order differential equation 


rv.i 


[<r, +R ,)J J + L, f ,1 

I 

* 

Q» 

i-k 

S3 

3 + 

dt 

L k .i n j J 


"dQ 


j 


( 8 ) 


|d0 r LjN id Tj 

dt L 


K a j N J 

Consequently, for an n-jointed robot, the n joint 
actuators as a whole can be represented by the (3n)th 
order vector differential equation 


A0(t) + B0(t) +C0(t) + DT(t) + ET (t) = V(t) 


(9) 


where V(t), 0(t) and T(t) are n x 1 vectors and the n x 
n diagonal matrices in (9) are defined by 

■ ».-[^*] - -.-m 

In a typical DC servomotor, the inductance of the 
armature winding is in the order of tenths of 
millihenries, while its resistance is in the order of a 
few ohms. Refer to J. Y. S. Luh: Conventional 

Controller Design For Industrial Robots — A Tutorial. 
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IEEE Trans. Systems, Man and Cybernetics, SMC 13(3), pp. 
298-316, 1983. Thus, the inductances Lj can safely be 

neglected an< j t ki s case the actuator model 

(9) reduces (2n)th order model 


G0(t) + C0(t) + ET(t) = V(t) 


where 


G « = 


K a^ . 


It 


( 10 ) 


is seen that the 


approximation 1^*0 has resulted in A = D = 0, hence a 
decrease in the order of the model from 3n to 2n. 

Now that the joint actuators have been modeled, we 
shall consider the manipulator dynamics. In general, 
the dynamic model of an n- jointed manipulator which 
relates the n x 1 "effective" joint torque vector T(t) 
to the n x 1 joint angle vector 0(t) can be written as 
(See J. J. Craig: Robotics — Mechanics and Control . 
Addison Wesley Publishing Company, Reading, MA, 1986) . 


M* (m, 0)0+ N* (m, 0,0) =T 


( 11 ) 


where m is the payload mass, M*(m, 0) is the symmetric 
positive-definite n x n inertia matrix, N*(m, 0,0) is the 
n x 1 vector representing the total torque due to 
Coriolis and centrifugal term, gravity loading term, and 
frictional term. The elements of M* and N* are highly 
complex nonlinear functions which depend on the 

manipulator configuration 0, the speed of motion 0, and 
the payload mass m. On combining (10) and (11), we 
obtain the integrated dynamic model of the manipulator- 
plus-actuator system as 


M(m,0)0 + N(m,0,0) =V 


( 12 ) 
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where the terms in (12) are defined as 

M(m,0) = G + EM* (m/0) ; N(m,0, 0) = C0 + EN* (m,0,0) 

Equation (12) represents a (2n)th order coupled 
nonlinear system with the n x 1 input vector V(t) of the 
armature voltages and the n x 1 output vector ®(t) of 
the joint angles. 

Although the manipulator dynamic model (11) and the 
integrated system model (12) are both of order (2n) , it 
is important to note that the integrated model (12) is a 
more accurate representation of the system than the 
manipulator model (11), which does not include the 
actuator dynamics. This is due to the following 
considerations : 

(i) Electrical Parameters: The main contribution from 
the electrical part of the joint actuators to the 
integrated system dynamics is the back-emf term k 0(t) 

b 

in (6) . This term can have a significant effect on the 
robot performance when the speed of motion 0(t) is high. 
Note that the back-emf appears as an internal damping 
term, contributing to the coefficient of 0(t) in (12) . 
The other electrical parameter is the armature 
resistance R which appears in (6) and converts the 
applied armature voltage v(t) to the current i(t) and in 
turn to the driving torque T(t). 

(ii) Mechanical Parameters. The major contribution from 
the mechanical part of the joint actuators to the robot 
performance is due to the gear ratios N^[< l] of the gear 
trains coupling the motor shafts to the robot links. As 
seen from (4), the "effective" driving torque on the jth 
link is reduced by a factor of Nj as seen by the jth 
joint actuator. In addition, from (2) and (3), the 
moments of inertia and friction coefficients of the jth 
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link are also reduced by a factor of (Nj ) 2 as seen by 
the motor shaft. This implies that the mechanical 
parameters of joint motors, namely the motor shaft 
inertia and friction, can have a significant effect on 
the overall system performance; particularly in robots 
with large gear ratios such as PUMA 560 where Nj is 
typically 1:100. 

3. TRACKING CONTROL SCHEME 

Given the integrated dynamic model of the 
manipulator-plus-actuator system as 

M(m,0)0 + N(m,0,0) =V (13) 

The tracking control problem is to devise a control 
system which generates the appropriate armature voltages 

V (t ) so as to ensure that the joint angles 0(t) follow 

any specified reference trajectories 0 r (t) as closely as 

possible, where 0 r (t) is an n x 1 vector. 

The intuitive solution to the tracking control 
problem is to employ the full dynamic model (13) in the 
control scheme in order to cancel out the nonlinear 
terms in (13) . This approach is commonly known as the 
Computed Torque Technique (see B. R. Markiewicz, supra) « 
and yields the control law 


V = M(m,0)[0 r + K v (0 r -0) +K p (0 r — 0)] + N(m, 0,0) (14) 

where K p and K v are constant diagonal n x n position 
and velocity feedback gain matrices. This results in 
the error differential equation 


e(t) + K v e(t) +K p e(t) =0 


(15) 
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where e(t) = 0 r (t) - 0(t) is the n x 1 vector of position 
tracking-errors. When the diagonal elements of K p and 
K v are positive, (15) is stable; implying that 
e(t) -* 0 or 0(t) — »0 r (t) as t-*~, i.e. tracking is 
achieved. In the control law (14), we have implicitly 
made a few assumptions which are rarely true in 
practice. The major problem in implementing (14) is 
that the values of the parameters in the manipulator 
model (13) are often not known accurately. This is 
particularly true of the friction term and the payload 
mass. Another problem in implementation of (14) is that 
the entire dynamic model (13) of the manipulator must be 
computed on-line in real time. These computations are 
quite involved, and the computer expense may make the 
scheme economically unfeasible. 

In an attempt to overcome the afore-mentioned 
limitations of the Computed Torque Technique, a new 
tracking control philosophy of the invention is proposed 
in this section. The underlying concept in this 
invention is that the full dynamic model is not required 
in order to achieve trajectory tracking and the lack of 
knowledge of full dynamics can readily be compensated 
for by the introduction of "adaptive elements" in the 
control system. Specifically, the proposed tracking 
control system. Figure 2, is composed of two components: 
the nominal feedforward controller 220 and the adaptive 
feedback controller 250. In Figure 2, the block 235 
represents the manipulator plus actuators of the type 
generalized earlier herein. The output signals on leads 
240 and 245 are the actual velocity and actual position 
of the system as sensed in any well known manner. Three 
separate input terms are depicted at leads 201, 202, and 
203. The signals on these input leads represent, 
respectively, desired position ( 0 r ) on lead 201, desired 
velocity (0 r ) on lead 202, and desired acceleration (0 r ) 
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on lead 203. The feedforward controller 220 contains 
computation elements 205 and 210. Computation elements 
205 and 210 take the known information that is available 
about the manipulator/system and compute, based upon the 
input signals at leads 201, 202, or 203 the available 
partial information that is fed to a summing junction 
215. The output from summing junction 215 is a signal 
V 0 , which signal is in turn fed into another signal 
junction 230. The signal V 0 from junction 215 is the 
feedforward component of the total command signal V that 
is developed by the invention at lead 232 into the 
manipulator plus actuators 235. Since the feedforward 
loop 220 is model-based, any known information about the 
manipulators or the actuator system is input into the 
control loop. Data on the manipulator dynamics can be 
used for real-time control at the required sampling 
rate. Such information can be, for instance, only the 
gravity loading term or the manipulator full dynamics 
excluding the payload. The feedforward controller 220 
is model-based and it acts on the desired joint 
trajectory 0 r (t) to produce the actuators driving 
voltage V 0 (t) . 

The role of the feedback controller 250 is to 
generate the corrective actuator voltage V a (t) , based on 
the tracking-error e(t), that needs to be added to V 0 (t) 
to complement the feedforward controller. The feedback 
controller 250 is composed of adaptive position and 
velocity feedback terms and an auxiliary signal f(t). 
The feedback gains, K v (t) at element 240 and K p (t) at 
element 242 are varied in accordance with an adaptation 
law that is described in greater detail hereinafter. 

Suffice it to say at this point that the error 
terms and the desired terms for position and velocity 
are adapted to form an adaptation component, V a , at 
junction 230 which is then combined with the feedforward 
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component V Q from loop 220 in order to yield the total 
command signal for the control system of this invention. 
Thus, the feedback loop 250 employs these stated signals 
which are updated continuously in real time to cope 
with the nonlinear nature of the system and 
uncertainties/variations in the manipulator parameters 
or payload. The feedforward and feedback controllers 
220, 250 are now discussed separately in Sections 3.1 
and 3.2. 

3.1 NOMINAL FEEDFORWARD CONTROLLER 

Suppose that some partial knowledge about the 
manipulator dynamic model (13) is available in the form 

of "approximations" to {M(m,0), N(m,0, 0)} denoted by 

{M o (m o ,0 r ),N o (m o ,0 r , 0 r ) , where m 0 is an estimate of m. 

Note that {M^NJ are functions of the reference 
trajectory 0 r (t) instead of the actual trajectory 0(t) . 
The information available in Mq and N 0 can vary widely 
depending on the particular situation. For instance, we 

can have M o (m o ,0 r )=O and N o (m o ,0 r 0 r ) = G(m,0 r ) f 

where only gravity information is available. Likewise, 
it is possible to have M o (m o ,0 r ) =M(0, 0 r ) and 
No( m o,® r , 0 r ) = N (O,0 r ,0 r ) ' 

where no information about the payload is available. 
Furthermore, the matrices Mo and N 0 may have either a 
"centralized" or a "decentralized" structure. In the 
centralized case, each element of M 0 and N 0 can be a 
function of all joint variables. In the "decentralized" 
case, [M 0 ] ii and [N 0 ]i are functions only of the ith 
joint variable and [Molij = 0 for all i^j. 

The nominal feedforward controller is described by 

V„(t) = M o (m o ,0 r )0 r (t) +N o (m o ,0 r ,0 r ) 


(16) 
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where V 0 is the n x 1 nominal control voltage vector and 
the controller operates on the reference trajectory 
0 r (t) instead of the actual trajectory 0(t) . It is 
important to realize that M 0 and N 0 are based solely on 
the information available on the manipulator dynamics 
which is used in the real-time control system of this 
invention. The controller matrices {M 0 , N 0 } can 

therefore be largely different from the model matrices 
{M,N} due to lack of complete information, or due to 
computational constraints. For instance, in some cases 
we may wish to discard some elements of M and N in order 
to reduce the on-line computational burden, even if the 
full knowledge of manipulator dynamics is available . 

3.2 ADAPTIVE FEEDBACK CONTROLLER 

In contrast to the feedforward controller 220, 
Figure 2, the feedback controller 250 does not assume 
any a priori knowledge of the dynamic model or parameter 
values of the manipulator plus actuators 235. This 
controller 250 operates solely on the basis of the 
tracking performance of the manipulator through the 
tracking-error e(t). The controller 250 is adaptive and 
its gains are adjusted continuously in real-time by 
simple adaptation laws to ensure closed-loop stability 
and desired tracking performance. The on-line 
adaptation compensates for the changing dynamic 
characteristics of the manipulator due to variations in 
its configuration, speed, and payload. 

The adaptive feedback controller is described by 

V a (t) = f(t) + K p (t)e(t) + K v (t)e(t) (17) 

where V a (t) is the n x 1 adaptive control voltage vector, 
e(t) = 0 r (t) - 0(t) is the n x 1 position tracking-error 
vector, f(t) is an n x 1 auxiliary signal generated by 
the adaptation scheme, and (K p (t),K v (t) } are the n x n 
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adjustable PD feedback gain matrices. The feedback 
control law (17) can be either "centralized" or 
"decentralized." For the centralized case, the 
controller adaptation laws are obtained as I have 
discussed in my paper entitled A New Approach to 

Adaptive Control of Manipulators, ASME Journal of 

Dynamic Systems, Measurement, and Control, Vol . 109, No. 
3, pp. 193-202, 1987. For the decentralized control 

case, the gains (K p (t),K v (t)} are diagonal matrices and 
their ith diagonal elements are obtained from the 
adaptation laws (21) - (24) with e(t) replaced by ei(t). 
See, for further explanation my paper H. Seraji: 

Decentralized Adaptive Control o f Manipulators: Theory, 

Simulation. and Experimentation . IEEE Journal of 
Robotics and Automation, 1988, (to appear) . The 
centralized case yields the controller adaptation laws 
as 

f(t) = Y x r(t) +y 2 r(t) (18) 

K p (t) = -^-[r (t) e# (t)] +a 2 [r(t)e'(t)] (19) 

K v (t) = p^trWeft)] + p 2 [r(t)e (t)] (20) 


where the prime denotes transposition, and r(t) is the n 
x 1 vector of "weighted" position-velocity error defined 
as 

r(t) = W p e(t) +W v e(t) (21) 

In (18) -(21), (Y^O^/Pj) are zero or positive 

proportional adaptation gains, (Y 2 ,<* 2 ,P 2 ) are positive 
integral adaptation gains, and W p = diagi(w pi ) and 
W v = diag (w ) are constant n x n matrices which contain 
the position and velocity weighting factors for all 
joints. Integrating (18) - (20) in the time interval 
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[0,t], one obtains 

f(t) - f(0) = Yjrtt) - r(0)] + y 2 / o t r(t)dt 

K p (t) -K p (0) = ajrftje'tt) - r(0>e'<0>] + (t)e'(t)dt 

K v (t) - K v (0) = Pjr^e'tt) - r(0)e'(0)] + P^rfcje (t)dt 


Since the initial values of the reference and actual 
trajectories are the same, we have e(0) = e(0) = r(0) = 0 . 
This yields the Proportional + Integral (P+I) adaptation 
laws 


f(t) = f(0) + y i r(t) + Y 2 J Q t r(t)dt 

(22) 

K D (t) =K(0) + a r(t)e'(t) +aj t r(t)e'(t)dt 

P P 1 2 0 

(23) 

K v (t) =K V (0) + p ir (t)e'(t) +p 2 J o t r(t)e'(t)dt 

(24) 


It is noted that the choice of {W p ,W v } affects all 
adaptation rates in (22) - (24) simultaneously; whereas 
the adaptation rate for each term {f(t),K p (t),K v (t)} can be 
affected individually be the selection of (Y^Ct^P^ 
independently. The proportional terms in the adaptation 
laws (22) - (24) act to increase the rate of convergence 
of the tracking-error e(t) to zero. The use of P+I 
adaptation laws also yields increased flexibility in the 
design, in accordance with the features of the 
invention, by providing a larger family of adaptation 
schemes than obtained by the conventional I adaptation 
laws . 

The physical interpretation of the auxiliary signal 
is obtained by substituting from (21) into (22) to yield 

f(t) = f(0) + YjWpeft) +W v e(t)] + y 2 J o t [W p e(t) + W v e(t)]dt 

= f(0) + [y i w p + y 2 w v ]e(t) + [Y 2 W p ]fe(t)dt+ CY 1 W v fe(t) 


( 25 ) 
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Hence, f(t) can be generated by a PID controller with 
fixed gains acting on the tracking-error e(t). Thus, 
the feedback controller (17) can be represented by the 
PID control law 

V B (t) = f(0) + K* p (t)e(t) + K* v * (t)e(t) +K*/ o t e(t)dt (26) 

where 

K* p (t) =K p (t) +Y 1 W p + Y 2 W v / K v (t) = K v (t) + Y : W V (27) 

; k; = y 2 w p 

It is seen that the feedback controller 250 as defined 
in accordance with equation (26) is composed of three 
terms which are effective during the initial, 
intermediate, and final phases of motion: 

(i) The initial auxiliary signal f(0) can be chosen to 
overcome the stiction (static friction) and compensate 
for the initial gravity loading. This term improves the 
responses of the joint angles during the initial phase 
of motion. 

(ii) The adaptive-gain PD term K p (t)e(t) +K*„(t)e(t) is 
responsible for the tracking performance during gross 
motion while the manipulator model is highly nonlinear, 
i.e., the changes of 0(t) and 0(t) are large. Each gain 

consists of a fixed part and an adaptive part . The on- 
line gain adaptation is necessary in order to compensate 
for the changing dynamics during the intermediate phase 
of motion. 

(iii) The fixed-gain I term K*/ t e(t)dt takes care of the 

I 0 

fine motion in the steady-state, while the changes of 
0(t) and 0(t) are small and the manipulator model is 
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approximately linear. Thus the I term contributes 
during the final phase of motion. 

3.3 TOTAL CONTROL SYSTEM OF THE INVENTION 

The total control system is obtained by combining 
the nominal feedforward controller 220 operating in 
accordance with equation (16) and the adaptive feedback 
controller 250 operating in accordance with equation 
(17) as shown in Figure 2 to yield the control law 
V(t) = V 0 (t) + V 0 (t) (28) 

= [tf o (m o ,0 r )0 r + N o <m o ,0 r ,0 r )] + [f(t) + K p (t)e(t) + K w (t)e(t)] 

where V is the total voltage applied at the actuators. 
It is important to note that in this control 
configuration, closed-loop stability is not affected by 
the feedforward controller. 

I shall now discuss two extreme cases : 

Case i: UNKNOWN MANIPULATOR MODEL 

When no a priori information is available on the 
manipulator dynamic model, that is, M 0 = N 0 = 0, the 
feedforward controller has no contribution, i.e. V 0 (t) = 
0. In this case, the control system approach reduces to 
the adaptive feedback control law 

V(t) = f(t) + K p (t)e(t) +K v (t)e(t) (29) 

which can be implemented with a high sampling rate. 

Case ii: FULL MANIPULATOR MODEL 

When the full dynamic model and accurate parameter 
values of the manipulator, actuators and payload are 
available for on-line control, that is M 0 = M and N c = 
N, the feedforward controller can generate the required 
actuator voltage V 0 (t). In this case, the adaptation 
process can be switched off and the feedback controller 
reduces to a fixed-gain PD controller {K p (0),V„ v(0) } . 
The control law is now given by 
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V(t) = M(m,0 r )0 r + N (m,0 r , 0 r ) + K p (0)e(t) + K v (0)e(t) (30) 

which is the feedforward version of the Computed Torque 

Technique. See, for example, P.K.Khosla and T. Kanade: 

Experimental Evaluat ion of Nonlinear EeedJbacJs and 

Feedforward Control Sch emes For Manipulators. Intern. 
Journ. of Robotics Research, Vol. 7, No. 1, pp. 18-28, 
1988. 

I have concluded that there is a trade-off between 
the availability of a full dynamic model and the 
controller adaptation process. In the range of possible 
operation, one can go from one extreme of no model 
knowledge and fully adaptive controller, to the other 
extreme of full model knowledge and non-adaptive 
controller . 

4. DIGITAL CONTROL ALGORITHM 

In Section 3, it is assumed that the control action 
is generated and applied to the manipulator in 
continuous time. In practical implementations, however, 
manipulators are controlled by means of digital 
computers in discrete time. In other words, the 
computer receives the measured data (joint positions 0) 
and transmits the control signal (actuator voltages V) 
ever T s seconds, where T s is the sampling period. It is 
therefore necessary to reformulate the manipulator 
control problem in discrete time from the outset. In 
practice, however, the sampling period T s is often 
sufficiently small to allow us to treat the manipulator 
as a continuous system and discretize the continuous 
control law to obtain a digital control algorithm. This 
approach is feasible for the invention, since the on- 
line computations involved for real-time control control 
are very small; allowing high rate sampling to be 
implemented . 

In order to discretize the control law, let us 
consider the adaptation laws (18) - (20) for the 


i 



25 


feedback controller and integrate them in the time 
interval [ (N-1)T S ,NT S ] to obtain 

T, 

f(n) = f(N - 1) +Y i lr(N) -r(N- 1)] + Y a * — tr(N) + r(N — 1)] 

K p (N) = K p (N — 1) +a i [r(N)e'(N) - r(N - l)e'(N - 1)] 

T 

+ a 2 • ~ 2 ~ [r(N)e'(N) + r(N - l)e'(N - 1)] 

K V (N) = K V (N-1) +p i [r(N)e'(N) - r(N - l)e (N - 1)] 

T 

+ P 2 • -y[r(N)e(N) + r(N - l)e'(N - 1)] 

where N and N-l denote the sample instants and refer to 
t = NT S and t = (N-1)T S , e (N) = 0 r (N) - 0(N) is the 

discrete position error, and the integrals are evaluated 
by the trapezoidal rule. The discrete adaptation laws 
can be written as 


r(N) = W p e(N) +W v e(N) 

(31) 

f(N) = f(N - l) +[y 2 - -y -y x }c(n-1) + [y 2 * -y + Y 1 ]r(N) 

(32) 

K p (N)=K p (N-l)+[a 2 * -y- - aJr(N - l)e'(N - 1) 

(33) 

+ [ a 2 • -y + aJr(N)e'(N) 


K V (N) = K W (N - 1) +[p 2 - -y -pJr(N - l)e'(N - 1) 

(34) 

+ [p 2 • -y+pJr(N)4'(N) 


In the above equations, we have assumed 

that the 


discrete velocity error e(N) is directly available using 
a tachometer; otherwise the velocity error must be 

formed in software as e(N) = . Equations 

(31) - (34) constitute the recursive algorithm for 

updating the feed-back controller. 
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Let us now evaluate the number of on-line 
mathematical operations that need to be performed in 
each sampling period T s to form the discrete feedback 
control law 

V„ (N) = f(N) + K p (N)e(N) +K v (N)e(N) (35) 

where e(N) and e(N) are assumed to be available. For a 
centralized feedback controller, the total numbers of 
additions and multiplications in forming V a (N) are 
equal to 6n2 + 3n and 6n 2 + 8n, respectively, where n is 
the number of manipulator joints. For a decentralized 
feedback controller, the numbers of operations are 
reduced to 9n additions and 14n multiplications. The 
small number of mathematical operations, particularly in 
the decentralized case, suggests that we can implement a 
digital servo loop with a high sampling rate, i.e. very 
small T s . This is a very important feature in digital 
control since slow sampling rates degrade the tracking 
performance of the manipulator, and may even lead to 
closed-loop instability. 

Let us now turn to the discrete feedforward control 

law 

V 0 (N) = M o Im of e r (N)]0 r (N) +N o [m o/ 0 r (N),0 r (N)) (36) 

where 0 r (N) and 0 r (N) are directly available from the 

trajectory generator. Since the feedforward controller 
is "outside" the servo loop, it is possible to have a 
fast servo loop around the feedback controller, and the 
feedforward voltage V 0 (N) is then added at a slower 
rate. Furthermore, the feedforward control action V 0 (N) 
is computed as a function of the reference trajectory 
0 r (N) only. In applications where the desired path 
0 r (t) is known in advance, the values of the voltage 
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V 0 (N) can be computed "off-line" before motion begins 
and stored in a look-up table in the computer memory. 
At run time, this precomputed voltage history is then 
simply read out of the look-up table and used in the 
control law. Such an approach can be quite inexpensive 
computationally at run time, while allowing 
implementation of a high servo rate for feedback 
control . 

The total control law in discrete time is given by 
V (N) = V c (N) + V a (N) 

= M o [m o/ e r (N)]0 r (N) + N o (m o ,0 r (N),0 r (N)] < 37 > 

+ f(N) + K p (N)e(N) + K v (N)e(N) 

Equations (31) - (37) constitute the digital control 

algorithm that is implemented for on-line computer 
control of robotic manipulators . 

5. ROBUST ADAPTIVE CONTROL 

The adaptation laws for the feedback controller as 
described in Section 3 are derived under the ideal 
conditions where unmodeled dynamics is not present and 
disturbances do not affect the system. In such 
idealistic conditions, the rate of change of a typical 
feedback gain K(t) which acts on the signal s(t) is 
found to be that set forth below in equation number 
(38) . In expressing this relationship the auxiliary 
signal f (t) , is developed by setting s(t) s 1 . 

K (t) = H i [jr (t) S ' (t) ] -I- |X 2 Cr < t)s ' (tt) 1 (38) 

where and (l 2 are scalar adaptation gains. Extensive 
simulation and experimental studies suggest that too low 
adaptation gains result in smooth variations in K(t), 
but poor tracking performance. On the other hand, too 
high adaptation gains lead to oscillatory and noisy 
behavior of K(t), but yield perfect trajectory tracking. 
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(Note that for both low and high adaptation gains, the 
range of control voltage is more or less the same, since 
it is primarily dependent on the reference trajectory 
and manipulator dynamics) . This argument suggests that 
large adaptations gains are necessary to maintain a high 
speed of controller adaptation in order to ensure rapid 
convergence of the tracking-error e(t) to zero. In 
practice, the adaptation gains cannot be selected too 
large due to a phenomenon known as "fast adaptation 

instability." When the speed of adaptation, i.e. K(t) , 

is too high, the gain K(t) drifts to large values and 
excites the unmodeled dynamics (parasitic) of the 
system, which in turn leads to instability of the 
control system. High speed of adaptation can be either 
due to large adaptation gains or fast reference 
trajectory. Another mechanism for instability can be 
observed in decentralized adaptive control systems . The 
interconnections among subsystems can cause local 
controller parameters drift to large values and hence 
excite the parasitic and lead to instability. For 
instance, a high amplitude or high frequency reference 
trajectory of one subsystem can destabilize the local 
adaptive controller of another subsystem by exciting its 
parasitic through the interconnections. 

We conclude that in an adaptive robot control 
system, unstable behavior can be observed with large 
adaptation rates or high degree of inter joint couplings. 
It is unfortunate that in trying to compensate for the 
change in the system, the adaptive controller may become 
sensitive to its own parameters. 

I shall now discuss two possible approaches for 
avoiding the fast adaptation instability: 

5.1 TIME-VARYING ADAPTATION GAINS 

From equation (38) , it is seen that the speed of 
adaptation depends on the magnitude of the 
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adaptation gains and on the weighted tracking- 

error r(t). Simulation studies with constant adaptation 
gain algorithms suggest that high gains lead to faster 
convergence of the tracking-error to zero. However, in 
the initial phase of adaptation, the weighted tracking- 
error r (t ) is large (e.g. due to static friction) and 
too high a value of adaptation gain causes instability 
problems. As the adaptation process goes on, the term 
r(t) decreases and at this time the adaptation gain is 
increased in order to achieve faster convergence . With 
this motivation, the constant adaptation gains and 
M- 2 in (38) are replaced by positive time-varying gains 
M-jfc) and H 2 (t) without affecting the stability 
analysis. The time functions M- 2 (t) and H 2 (t) start with 
small initial values when the errors are usually large 
and, as time proceeds, build up to appropriate large 
final values when the errors are small . 

5.2 ROBUSTNESS VIA d-MODIFICATION 

The P+I adaptation laws discussed so far have no 
provision for rejecting the destabilizing effect of 
"noise" introduced through unmodeled dynamics or 
disturbances. The integral term in the adaptation law 
acts to integrate a quantity related to the noise term 
squared. The integration of such a non-negative 
quantity inevitably creates an undesirable drift in the 
integral term and ultimately deteriorates the adaptive 
system performance . 

Ioannou and Kokotovic ( Instability Analysis and 

Improvement ol Robustness of Adaptive Control . 

Automatica, Vol. 20, No. 5, pp. 583-594, 1984; Robust 
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Redesign of Adaptive Control . IEEE Trans. Aut . Control, 
Vol. AC-29, No. 3, pp. 202-211, 1984) suggest "O- 
modif ication" to the adaptation law in order to 
eliminate the drift in the integral term and thus 
counteract instability. The basic idea is to modify the 
adaptation law (38) by adding a term — OK(t) which 
removes its purely integral action, that is, instead of 
(38) we use the O -modified law 

K(t) = -OK(t) [r(t)s'(t)J +H 2 [r(t)s'(t)] (39) 

where a is a positive scalar design parameter. The 
size of a reflects our lack of knowledge about the 
unmodeled dynamics and disturbances. In equation (39), 
the leakage or decay term -0K(t) acts to dissipate the 
integral buildup, and eliminate the drift problem which 
excites the parasitic and leads to instability. The 
price paid for the attained robustness is that the 
tracking-error ||r(t)|| now converges to a bounded non-zero 
residual set, and hence perfect trajectory tracking is 
no longer achieved in theory. The size of this residual 
set depends on the value of a, but can often be made 
sufficiently small so that performance degradation is 
acceptable in practice. The drawback of the o-modified 
adaptation law, however, is that in the absence of 
unmodeled dynamics and disturbances, we can no longer 
guarantee that lim J|r(t)|| = 0, unless O = 0. 

The Proportional + Integral + Sigma (P + I + a) 
adaptation laws for the feedback controller in 
continuous time are now given by 

f(t) = f (0) + y i r(t) + y 2 Jjr (t)dt - O^fft) dt (40) 

K p (t) =K p (0) + a 1 r(t)e / (t)+a a / o t r(t)e'(t)dt-G 2 J o t K p (t)dt 


(41) 
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K v (t) = K v (0) + 0 ir (t)e'(t) + p 2 /‘rfcJe'flOdt - a^K^t) dt (42) 
where {<T 1 /0 2 /0 3 } are positive scalar design parameters. 
For digital control implementation with sampling period 
T , (40) - (42) yield the recursive adaptation laws 


f(N) = 


1-a 


1 2 


f(N - 1) + 


K p (N) = 


1+ °ifj 

T 

1 -° 2 T 


1+ ° 2 fj 


K p (N — 1) + 


K V (N) = 


t 

T . 

1 + °3Tj 


K V (N — 1) + 



b(N- 1) + 


2 2 


■*1 


l+O 


1 2 J 


k(N) 


- h:(N - l)e'(N - 1) (43) 


k(N)e'(N) 


hr(N - l)e (N - 1) 


(44) 


(45) 


P 2 T -Pi 


k(N)e(N) 


In conclusion, the use of a -modification is 
essential in obtaining sufficient conditions for 
boundedness in the presence of parasitic. However, in 
the absence of parasitic, o causes a tracking-error of 
0 (Vo) to remain. Therefore, there is a trade-off 
between boundedness of all signals in the presence of 
parasitic and loss of exact convergence of the tracking- 
error to zero in the absence of parasitic. In other 
words, we have sacrificed the performance in an 
idealistic situation in order to achieve robustness in 
realistic situations which are more likely to occur in 
practical applications. 

6 . SIMULATION RESULTS 
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The tracking control scheme developed in Section 3 
has been applied to a two-link manipulator for 
illustration of the benefits of the invention. 

Consider the planar two-link manipulator in a 
vertical plane shown in Figure 3, with the end-effector 
carrying a payload of mass m. The robot links are 
assumed to be driven directly (without gears) by two 
servomotors with negligible dynamics . Hence the arm is 
"direct drive" and we can treat the joint torques as the 
driving signals. The dynamic equation of motion which 


relates the joint torque vector T=l I to the joint 


angle vector 0= is given by H. Seraji, 

w 


Approach to Adaptive Control of Manipulators, supra/ and 
H. Seraji, M. Jamshidi, Y. T. Kin, and M. Shahinpoor, 
Linear Multivariable C ontrol of Two-Link Robots. Journal 
of Robotic Systems, Vol. 3, No. 4, pp. 349 - 365, 1986. 

T = M (0) 0 + N (0,0) +G(0) +H(0) + mj'(0) [J(0)0 + j(0,0)0+ g] < 46) 
where the above terms are: 


a +a cos 0 s + — - oos 8 

II 2 3 1 2 

M(0) = ; N(0,0) = 


a j+ — co.e i 


-(<* Sift0 ) (0 0 ♦— 
1 2 12 2 


(V in9 ,)T 


G(0) = 


'a cos 0 4a cos (0 +0 ) 

4 IS 12 


a cos (0 40 ) 

S 1 2 


; H(0) = 


V 0 +V sgn (0 ) 
112 1 


V 0 4-V S91 (0 ) 

2 2 4 ^ 2 
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J(6) = 


-1 sin 8 -1 sin (8 +8 ) -1 sin (8 -»8 ) 

1 12 1 2 2 1 2 


1 1 cosd^l ^cos (8 ^48 2 > 1 ^co* (0^+0^) 


n 


In the above expressions, a , ...,oc are constant 
parameters obtained from the masses (m lf m 2 ) and the 
lengths (li,l 2 ) of the robot links, and (Vi,V 3 ) and 
(V 2 ,V 4 ) are coefficients of viscous and Coloumb 
frictions respectively. For the particular robot under 
study, the numerical values of the link parameters are 
mi = 15.91Kg; m 2 = 11.36Kg; li * 1 2 = 0.432m so that they 
represent links 2 and 3 of the Unimation PUMA 560 robot. 
This yields the following numerical values for the model 
parameters (H. Seraji, M. Jamshidi, Y. T. Kim, and M. 
Shaninpoor, supra ) 


a x = 3.82 


<* 2 = 2.12 

a t = 81.82 


a 3 = o.7i 
a 5 =24.06 


The friction coefficients are chosen as Vi = V 3 * 
1 .ONt .m/rad. sec-i and V 2 = V 4 * 0.5NT.m and the payload 
mass is initially m * lO.OKg. 

The joint angles 8 i (t) and 0 2 (t) are required to 
track the cycloidal reference trajectories 


ft - * j. iriJtt . 2JCt ' 

8 .. (t) - 2 + <L 3 Sin 3 . 


OS tS3 


3 < t 


e r2 (t) = -sin^f^] 0 £ t £ 3 

= y 3 < t 

so that the robot configuration changes smoothly from 

the initial posture = ~ y>0 2 = 0} to the final posture 

{0 1 = O,0 2 =+ y ) in three seconds. The joint angles are 

controlled by the feedforward and/or feedback tracking 
control scheme 

T(t)= [M (0 r )0 ! + M (0 r ) 0 , + G(8 r )] 

* r 1 (48) 

+ [f x (t) + k ^tje^tj+k jttje^t)] 
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T 2 (t) = w 21 <e r )0 , + M 22 (e r )0 4 + G 2 (e r )] 

+ [f 2 (t)+k ,(t)e 2 (t) +k ,(t)e 2 (t)] 

P V 

where M and G are defined in (46) . It is seen that each 
tracking control law is composed of feedforward and 
feedback components. The feedforward component has a 
centralized structure and is based on the manipulator 
dynamic model (46) . It is a function of the reference 
trajectory 0 r (t) and contains the inertial acceleration 

term M(0 r )0 r and the gravity loading term G(0 r ) of the 
arm itself, without the payload (i.e., m = 0) . The 
Coriolis and centrifugal term N(0,0) , the frictional 

term H(0) , and the payload term are assumed to be 

unavailable for on-line control and are not incorporated 
in the feedforward controller. The feedback controller 
has a decentralized structure and is composed of the 
auxiliary signal f(t), the position feedback term 
k p (t)e(t), and the velocity feedback term k v (t)e(t) where 
e(t) = 0 r (t) — 0 (t) is the position tracking-error in 
radians. The feedback terms are updated as (In this 
example, the a -modification was not necessary, and hence 
o 1 =o 2 = a 3 = 0) . 

f x (t) = f x ( 0 ) + Yj r 3 (t) + y t j\: 2 (t)dt - a J^f x (t)dt = jSr^t) dt ( 49 ) 

2 1 

k pl (t) = k pi ( 0 ) + a i r 1 (t)e 1 (t) +a jjr 1 (t)e J (t)dt - a jjk^ (t)dt 

12 2 

= r l (t)e 1 (t) + 10 /Jr ^tje^tjdt (50) 

k vi (t)=k vi< 0) + P .r^Je^tJ+p fr^e^dt-a f k wl (t)dt 

12 3 

= r 1 (t)e l (t) + lOj'r l (t)e i (t)dt ( 51 ) 
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r (t) = w e, (t) + w e,(t) = 3000e, (t) + 1500e , (t) (52) 

Note that the initial values of the auxiliary signal and 
the feedback gains are all chosen arbitrarily as zero. 
(The numerical values of w p and w v in (52) are large 
since the unit of angle in the control program is 
"radian." A simple trapezoidal integration rule is 
used to compute the integrals in the adaptation laws 
(49) - (51) with dt - 1 millisecond. 

To evaluate the performance of the proposed control 
scheme, the nonlinear dynamic model of the manipulator- 
plus-payload (46) and the tracking control scheme (48) 
are simulated on a DEC-VAX 11/750 computer with the 
sampling period of 1 millisecond. In order to 
illustrate the effectiveness of the proposed control 
scheme to compensate for sudden gross variation in the 
payload mass, the mass is suddenly decreased from m = 
10.0 Kg to zero at 6 = 1.5 seconds (i.e. the payload is 
dropped) while the manipulator is in motion under the 
control system operating in accordance with equation 
(48) . The results of the computer simulation are shown 
in Figure 4(i) - (ii) and indicate that the joint angles 
0 i (t) and 0 z (t) track their corresponding reference 
trajectories 0 (t) and 0 (t) vary closely throughout 

( r r 

the motion, despite the sudden payload variation. 
Figures 5(i) - (ii) and 6(i) - (ii) show the responses 
of the tracking-errors ei(t) and e 2 (t) and the control 
torques Tj (t) and T 2 (t), and indicate a sudden jump at t 
= 1.5" due to payload change. To show the feedback 
adaptation process, time variations of the auxiliary 
signal f x (t), the position gain k P i(t), and the velocity 
gain k vi (t) are shown in Figures 7 (i) - (iii) . It is 
seen that the feedback terms have adapted rapidly on- 
line to cope with the sudden payload mass change. The 
results demonstrate that the invention does not require 
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knowledge of the payload mass m and can adapt itself 
rapidly to cope with unpredictable gross variations in m 
and sustain a good tracking performance. 

7. EXPERIMENTAL RESULTS 

In this section, the tracking control system 
described in Section 3.2 is applied to a PUMA industrial 
robot for test and evaluation purposes . 

The testbed facility at the JPL Robotics Research 
Laboratory consists of a Unimation PUMA 560 robot and 
controller, and a DEC MicroVAX II computer, as shown in 
the functional diagram of Figure 8. The MicroVAX II 
hosts the RCC1 (Robot Control "C" Library) software, 
which was originally developed at Purdue University (V . 

Hayward and R. Paul, Introduction to RCCL.: A.. Robot 

Control 'C' Library . Proc. IEEE Intern. Conf. on 
Robotics, pp. 293 - 297, Atlanta, 1984, and subsequently 
modified and implemented at JPL. During the operation 
of the arm, a hardware clock constantly interrupts the 
I/O program resident in the Unimation controller at a 
preselected sampling period T s , which can be chosen as 
7, 14, 28 or 54 milliseconds. At every interrupt, the 
I/O program gathers information about the state of the 
arm (such as joint encoder readings), and interrupts the 
control program in the MicroVAX II to transmit this 
data. The I/O program then waits for the control 
program to issue a new set of control signals, and then 
dispatches these signals to the appropriate joint 
motors. Therefore, the MICROVAX II acts as a digital 
controller for the PUMA arm and the Unimation controller 
is effectively by-passed and is utilized merely as an 
I/O device to interface the MicroVAX II to the joint 
motors . 

To test and evaluate the control system described in 
Section 3, the tracking controller is implemented on the 
waist joint 8 i of the PUMA arm, while the other joints 
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are held steady using the Unimation controller. The 
waist control law is coded within the RCCL environment 
on the MicroVAX II computer. It is assumed that the 
dynamic model and parameter values of the arm are not 
available, and hence the feedforward controller is 
eliminated. The control torque for the waist joint at 
each sampling instant N is obtained from the adaptive 
PID feedback control law 

T(N) = f(N) + k p (N)e (N) + k v (N)e(N) (53) 

where e(N) = 6 i (N)-0 i (N) ± s the waist position error, 
e(N) - e(N - 1) . 

e(N) = — is the waist velocity error formed 

1 S 

in the software, and ® ,(t) is the reference trajectory 

for the waist joint . The feedback terms are generated 
by the following simple recursive adaptation laws (In 
the experiment, it was not necessary to use o~ 


modification and hence we set a=0) . 

r(N) = 30e(N) + 20e(N) (54) 

f(N) = f(N - 1) + 0.175 [r(N) + r(N - 1)] (55) 

k p (N) = k p (N — 1) +0.35[r(N)e(N) +r(N-l)e(N -1)] (56) 

k V (N) = k v (N - 1) + 2.8[r(N)e(N) + r(N - l)e(N - 1)] (57) 


where the adaptation gains are found after a few trial- 
and-errors. The sampling period is chosen as the 
smallest possible value T s * 7 milliseconds (i.e. 

sampling f s = 144H t ), since the on-line computations 
involved in the control law (53) are a few simple 
arithmetic operations. No information about the PUMA 
dynamics is used for implementation of the control 
system, and hence the controller terms are initially 
zero; i.e. f(0) = k p (0) = k v (0) = 0 . 

The PUMA arm is initially at the "zero" position 
with the upper-arm horizontal and the forearm vertical, 
forming a right-angle configuration. The waist joint 
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1 angle O^t) is commanded to change from the initial 

2 

position 0^ = 0 to the goal position in 2 

4 seconds. The reference trajectory 0 ,(t) is synthesized 

r 

5 by the cycloidal trajectory generator in RCCL as 


0 ,(t) = -j ftt - sinrct] 

T" * 


0 St £2 


g While the arm is in motion, the reading of the 

10 wa ist joint encoder at each sampling instant is recorded 

11 directly from the arm, converted into degrees and stored 

12 in a data file. The values of the auxiliary signal and 

13 feedback gains are also recorded at each sampling and 

14 kept in the same data file. Figure 9 (i) shows the 

15 desired and actual trajectories of the waist joint angle 
10 and the tracking-error is shown in Figure 9 ( ii) . It is 

1 7 seen that the joint angle B^t) tracks the reference 

18 trajectory 0 t (t) very closely, and the peak value of the 

19 tracking-error e(t) is 1 . 40 ’ . The initial lag in the 

20 0 X response is due to the large stiction (static 

21 friction) present in the waist joint. 

22 Figures 10 (i)-(ii) show the tracking performance of 

23 the waist joint for the same motion using the Unimation 

24 controller, which is operating with the sampling period 

25 of 1 millisecond f, = lKH 2 . It is seen that the peak 

26 joint tracking-error in Figure 10 (ii) is 5 . 36 *, which 

27 produces 4 centimeters peak position error at the end- 

28 e f f ector. By comparing Figures 9 (ii) and 10 (ii), it is 

29 evident that the tracking performance of the adaptive 

30 controller is noticeably superior to that of the 

31 Unimation controller, despite the fact that the 

32 Unimation control loop is 7 times faster than the 

33 adaptive control loop. The variations of the auxiliary 

34 signal f(t), the feedback gains k p (t) and k v (t), and the 

35 control torque T(t) are also shown in Figures 11 (i)- 
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(iv) . It is seen that f(t), k p (t), k v (t), and T(t) all 
start from the initial values of zero and change with 
time . 

I shall now discuss a test relating to the tracking 
performance of the adaptive controller in a different 
situation. Suppose that the configuration of the arm is 
changed smoothly from the initial zero posture to the 
final vertical posture while the waist joint is in 
motion. This effectively imposes a dynamic inertial 
load on the waist motor, as well as introducing torque 
disturbances in the waist control loop due to inter- 
joint couplings. We now specify a different desired 
trajectory for the waist angle whereby 0 : is commanded 

to change from 0 to -y in three seconds while tracking 

a cycloidal trajectory. Using the same adaptive control 
law (53), the actual recording from the waist joint and 
the desired trajectory are shown in Figure 12 (i) and the 
tracking-error is plotted in Figure 12 (ii). It is seen 
that the actual trajectory tracks the new desired 
trajectory very closely, despite the dynamic loading on 
the waist joint and the inter-joint coupling 
disturbances. The experimental results demonstrate that 
the invention's control system is not sensitive to the 
arm configuration, torque disturbances, or the desired 
trajectory. 

The following observations are made from further 
experiments on the PUMA robot: 

1. Using the Unimation controller, the tracking- 
error increases for fast motion under heavy payload. 
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For instance, when the arm is fully extended 
horizontally carrying a five pound payload and the waist 
joint is moved by 90* in 1.2 seconds, the peak joint 
tracking-error is about 9*. When transformed to the end- 
effector, this gives the peak tip error of 16 
centimeters . 

2. The rate of sampling, T s , has a central role 
in the performance of the proposed control scheme. In 
the adaptive feedback controller, the sampling rate 
determines the rate at which the feedback gains and the 
auxiliary signal are updated. Faster sampling rate 
(smaller T s ) allows higher adaptation rates to be used, 
which in turn leads to a better tracking performance. 
When the sampling rate is slow (large T s ) , the tracking 
performance is degraded, and the use of high adaptation 
gains may lead to closed-loop instability. For 
instance, for T s = 14 msec, the adaptation gains in (54)- 
(57) must be reduced to maintain stability, and this 
degrades the tracking performance. In general, when T s 
is large, the effects of sampling and discretization are 
more pronounced and the control system performance is 
degraded. Therefore, in practical implementation, it is 
highly desirable to increase the sampling rate as much 
as possible by optimizing the real-time control program 
or using a multi-processor concurrent computing system. 
In the present experimental setup, for any sampling 
period T s , about three msec is taken up by the 
communication between the Micro VAX II and the Unimation 
controller; hence for T s * 7 msec only four msec is 
available for control law computations. 
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3. The feedback adaptation gains in (54) -(57) 
should not be chosen unnecessarily high, since too high 
gains can lead to instability while producing negligible 
improvement in the tracking performance (e.g., an 
acceptable peak error of 1* may decrease to 0.1*). For 
instance, for the adaptation gains given in the above 
experiment, motion of by 90* in 1.2 seconds leads to 
instability. In this case, it is necessary to decrease 
the adaptation gains or to introduce ^-modification in 
order to ensure stability. Therefore, in general, the 
adaptation gains must be chosen to yield an acceptable 
tracking performance for the fastest trajectory in the 
experiment, i.e. "the worst case design". 

4. For very slow motions of the waist joint 
(e.g., average speed of 2 '/see), the friction present in 
the waist joint has a dominating effect and therefore 
the implemented control scheme has a poor performance. 
This is due to the fact that in this case, the control 
torque T c is comparable in magnitude to the stiction T s 
of the joint, and hence the net torque T c -T s applied to 
the joint is not sufficiently large. Therefore, for 
slow motions, it is necessary to introduce a feedforward 
controller or a friction compensation in order to 
counteract the effect of stiction. The situation is 
improved by increasing the sampling rate. 

8 . CONCLUSIONS 

A new and simple robust control system for accurate 
trajectory tracking of # robotic manipulators has been 
described. The control system takes full advantage of 
any known part of the manipulator dynamics in the 
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feedforward controller. The adaptive feedback 
controller then compensates for any unknown dynamics and 
uncertainties/variations in the manipulator/payload 
parameters . 

From the tracking point of view, it is desirable to 
set the feedback adaptation rates as high as possible so 
that the feedback controller can respond rapidly to 
variations in the manipulator dynamics or sudden changes 
in the payload. High rate adaptation, however, can 
cause instability through the excitation of unmodeled 
dynamics . The instability is counteracted in the paper 
by the addition of decay terms to the integral 
adaptation laws to yield an adaptive controller which is 
robust in the presence of unmodeled dynamics and 
disturbances . 

The feedback adaptation laws in Section 3 are 
derived under the assumption that the robot model is 
"slowly time-varying" in comparison with the controller 
terms . In theory, this assumption is necessary in order 
to derive simple adaptation laws which do not contain 
any terms from the robot model. In practice, the 
simulation and experimental studies of Sections 5 and 6 
justify the assumption, even under gross abrupt change 
in the payload. This is due to the robust nature of 
adaptive control schemes, which is discussed briefly in 

H. Seraji, A New Approach Xq Adaptive — Control — Of. 

Manipulators , supra . Nevertheless, further simulations 
and experiments need to be performed using direct-drive 
arms in fast motions to test the practical limitations 
of the simplifying assumption of slow time variation. 
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Simulation results for a two-link robot and 
experimental results of a PUMA industrial robot validate 
the capability of the invention's control system in 
accurate trajectory tracking with partial or no 
information on the manipulator dynamics. 

Finally, the control features presented herein 
can readily be extended to the direct control of end- 
effector position and orientation in Cartesian space. 
In this formulation, the controllers operate on 
Cartesian variables and the end-effector control forces 
are then transformed to joint torques using the Jacobean 
matrix (H. Seraji, An. Approach To Mu ltivariable Control 

&£. Manipulators , ASME Journ. Dynamic Systems, 

Measurement and Control, Vol. 109, No. 2, pp. 146 - 154, 

1987 and H. Seraji, Direct Adap tive Cont ro i Of 

Manipulators — In.. Cartesian Space. Journal of Robotic 
Systems, Vol. 4, No. 1, pp. 157 - 178, 1987. 

The above description presents the best mode 
contemplated in carrying out my invention. My invention 
is, however, susceptible to modifications and alternate 
constructions from the embodiments shown in the drawings 
and described above. Consequently, it is not the 
intention to limit the invention to the particular 
embodiments disclosed. On the contrary, the invention 
is intended and shall cover all modifications, sizes and 
alternate constructions falling within the spirit and 
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scope of the invention, as expressed in the appended 
claims when read in light of the description and 
drawing . 


4 / 

ROBUST HIGH-PERFORMANCE CONTROL FOR ROBOTIC 
MANIPULATORS 

ABSTRACT OF THE DISCLOSURE 

Model-based and performance-based control 
techniques are combined for an electrical robotic control 
system. Thus, two distinct and separate design philosophies 
have been merged into a single control system having a 
control law formulation including two distinct and separate 
components, each of which yields a respective signal 
component that is combined into a total command signal for 
the system. Those two separate system components include a 
feedforward controller and a feedback controller. The 
feedforward controller is model-based and contains any known 
part of the manipulator dynamics that can be used for on-line 
control to produce a nominal feedforward component of the 
system's control signal. The feedback controller is 
performance-based and consists of a simple adaptive PID 
controller which generates an adaptive control signal to 
complement the nominal feedforward signal . 
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